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Introduction to the CEA family: structure, function and 
secretion 

S. VON K LEI ST 

Institute oflmmunobiology. University of Freiburg, Medical Faculty, Freiburg - Germany 

ABSTRACT: Due to the phenomenal progress in the field of tumor immunology that took place 
during the last twenty years, we dispose today of highly specific and sensitive techniques and reagents 
like monoclonal antibodies (MAbs). In this context the discovery in human carcinomas of tumor- 
associated antigens, such as CEA , was of primary importance, especially since the latter was found to 
have clinical relevance as a tumor marker. Based on animal models, a new in vivo technology for the 
detection of tumors and metastases was developed in recent years, that uses anti-CEA MAbs, or 
fragments of them, coupled to radio-isotopes. This technique, called radio-immunodetection 
(RAID), also paved the way for immunotherapeutic procedures, where again CEA served as the 
target-antigen. This new technique holds great promise, provided the epitope-specificity of the M Abs 
is well-controlled: it has been shown that CEA belongs to a large gene-family of at least 22 members, 
which can be subdivided into two subgroups 'i.e.. the CEA- and the PSG -subgroup) and which in turn 
belongs to the immunoglobulin- super gene family. Great structural similarities render the distinction 
of the various cross-reactive molecules by immunological means rather difficult, (hit J Biol Markers. 
1992; 7; 132-6) 

KEY WORDS: LEA gene, Structure, Molecular biology 



At present the exact characterization of tumor anti- 
gens is one of the most important and most active fields 
of experimental tumor immunology. It is virtually the 
basis of all investigations concerning the possible role 
such substances may play in immune surveillance, im- 
mune recognition and defense mechanisms of malign- 
ant cells in clinical oncology, in immunodiagnosis, and 
immunotherapy. 

Since results that have been obtained with virus- or 
chemically-induced mouse tumors are not even valid 
for all strains of mice and much less for human tumors, 
the discovery of the carcinoembryonic antigen (CEA) 
in man cannot be overemphasized. 

Indeed, little more than 25 years ago two groups. 
Gold and Freedman in Canada and von Kleist and 
Burtin in France, described independently the presence 
of this tumor antigen in colonic carcinomas and their 
metastases (1,2). It was called carcinoembryonic anti- 
gen because it was also shown to be present in the 
homologous embryonic tissues. The name is strictu 
senso incorrect, because CEA is produced not only 
during the first trimester of gestation, but throughout 
pre-natal life and even after birth. When it was shown 
that CEA circulated in the blood of cancer patients, the 
hardly characterized antigen was immediately tried for 



its usefulness in clinical oncology, which was rapidly 
proven. So this antigen became world-wide known as 
the first truly useful tumor marker, before anyone had 
the time or possibility to study and define its com- 
plicated molecular nature. 

Although Gold and Freedman in their first paper in 
1965 already described the presence of car- 
cinoembryonic antigens (1), they were quoted the?e- 
after of having discovered a CEA, and it took almost a 
decade before it was generally accepted that CEA was 
not an entity, but rather a member of a heterogeneous 
family of more or less closely related cross-reactive 
glycoproteins, which could not be clearly distinguished 
by the technical means then currently employed, for 
instance the double-immunodiffusion or Immuno- 
electrophoresis with monospecific antisera or the char 
acterization of physico -chemical properties. 

Table I summarizes the properties thought essentia i 
at that time for defining the CEA, and any human 
glycoprotein fulfilling these criteria was called CEA. 
When put in a test, a radioimmmuno-assay or en/y- 
moimmuno assay, this CEA worked beautifully as a 
tumor marker in clinical hands and this, as everybody 
knows, till the present day. However, very soon it was 
obvious that these criteria were non-essential, becauso 
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TABLE I - PHYSICO-CHEMICAL PROPERTIES OI (iA 



MW of 180.000 i 20.000 D 

beta dcctrophorclic mobility in neutral pH 

ipof.V75 i 0.25 

soluble in perchloric acid 

soluble in 50% saturated ammonium sulphate 

unsoluble in cthanol 

heat stable 

main amino acid: asparaginc/aspartic acid 
main sugar* N-acetylglycosamine 
single polypeptide chain 



they were shared by many CEA-related substances. In 
1972 we described the first, most important of these 
cross-reactive antigens (3), which received the rather 
uncomitting name of non-specific cross-reacting anti- 
gen or NCA, a term which has now been generally 
accepted for this group of molecules, several of which 
have been described under various denominations, but 
were later found to be either identical to the NCAs or to 
belong to the CEA family as separate members. 

Since the cross-reactions were first observed when 
monospecific but polyclonal antisera were employed, it 
was logic to explain them by crossreactions with CEA- 
like compounds and it was hoped to define specifically 
the CEA once these cross-reactions were eliminated by 
the development and use of CEA-rcactive monoclonal 
antibodies. When this was not the case, it became quite 
evident that CEA not only was an almost ubiquitous, 
heterogeneous molecule, but also had very few specific 
and many shared epitopes. 

It took another 1 5 years before the successful clon- 
ing of the genes and the characterization of cDNAs 
showed that the CEA-gene represents the prototype of 
a large gene-family, of which so far more than 30 
members have been identified and there are indi- 
cations that there are many more (4). 

In 1989 the CEA gene-family has been officially 
reconstituted at an ISOBM-Workshop (5): according 
to consistent amino-acid sequence similarities it is di- 
vided into two subgroups: the CEA- and the PSG- 
subgroup, which are depicted in Table II. 

The first subgroup comprises the genes encoding 
for CEA, which exists in a 180 kD and a 160 kD form 
and the genes of the other classical crossreacting anti- 
gens, like the NCA and the biliary glycoprotein BGPE 
first described in human bile by Svenberg as a heavily 
glycosylated 85 kD protein, of which probably more 
than 10 different molecules exist (6). There are some 
further CEA family members (abbreviated by the pre- 
fix CGM). namely the COM 6, of which the gene pro- 
duct most probably is the NCA95. first described by 



TABLF II - CLASSIFICATION OF THE HUMAN ( FA GENE FA 
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Old gene or dune name 


New gene or 


rnRNA name 


I CFA subgruup 


CFA 


CEAa 




CEAb 


NCA 


NCA 


hsCGMfi, Mf>, NCA-W272 


CGM 6 


BGPLTM-I CFA 


BGPa 


TM-2 CEA 


BGPb 


TM-3CEA 


BGPc 


TM-4 CFA 


BGPd 


4-22 


BGPe 


4-13 


BGPf 


hsCGMl 


CGM 1 


hsCGM2 5 EH. 7E12 


CGM 2 


2 PSG subgroup 


PSBG 


PSGl 


PSG«, PSBG-D, hPSPl 1. FL-NCA-2, hPS3, PSG la 


PSG la 


PSG 16 


PSG lb 


PSBG-C 


PSGlc 


FL-NCA-I, PSGld, SG9 


PSGld 


PSBG -Ci. PSG95 


PSGIe 


PSBG-D' 


PSGlf 


PSBG-E. SG8 


PSG2n 


pSPl-i, hcl7, PS35, hTS16, PSG A, SG5 


PSG3m 


PSG4. hsCGM4, hHSP2, 


PSG4 


PSG 9 (PS^) 


PSG4a 


PSG5, PSBG HL, FL-NCA-3 


PSGS 




PSG5n 




PSG.Sm 


hsC(,M3. PSGGB 


PSG6 


PSG6 


PSGor 


hPSH, PSG 10 


PSGbs 


PSG 7 


PSG 7 


CGM V5 


PSG 8 


hTSl 


PSGHa 


PS34. hPS2, PSBG B 


PSG 1 1 s 




PSGUw 
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Buchegger et al. in 1984 and found in leukocytes and 
chronic myeloid leukemia (7), the COM 1 , CGM2, and 
CGM7-9, for which the gene products have yet to be 
found. Another member of the CEA-family, a CEA 
cross-reacting antigen of 140 kD expressed on human 
lymphoid cell-lines, has only recently been discovered 
by Kuroki et ah (8). The size of the CEA gene-family as 
estimated from Southern analyses of total genome 
DNA indicated the existence of 10-13 genes; however, 
this number may change (4). 

Only a few years ago it was shown that the pregnan- 
cy-specific glycoproteins (PSGs) genetically belong 
also to the CEA gene-family. This subgroup consists of 
at least 1 1 , but most likely many more members, which 
according to the new nomenclature are numbered 
PSG1-PSG1L Several mRNA-products of PSG are 
differentially spliced (they are designated with a small 
letter, e.g. PSG la, PSG lb etc.), which leads to many 
protein variants (9). 

The pregnancy-specific glycoprotein 1 (PSG1 ) and 
SP-1, i.e. Schvvangerschaftsprotein !, were again de- 
scribed independently by two groups, i.e. Tatarinov 
and Masyukevich and Hans Bohn (10, 11). The PSGs 
are a group of very heterogeneous glycoproteins, which 
under physiological conditions are predominantly ex- 
pressed by the placenta without, however, being ex- 
clusively specific for this tissue, since PSG has also been 
reported in testicular tissues, the salivary glands, and 
fetal liver (12). In malignant lesions PSGs are found 
mainly in trophoblastic tumors, however, in a small 
percentage of cases it has also been found in tissues of 
breast and colonic carcinomas and multiple myelomas 
as well as in a leukemia cell-line HL60 and in the sera of 
patients with various gynecological carcinomas (13). 

The various members of one subgroup show a great 
similarity at the nucleotide level of about 89-95%, 
however, a significantly lower similarity of only 66- 
70% is seen between the two subgroups (14). 

The molecular structure of the different members 
of the CEA geneTamily up to a certain degree follows a 
similar pattern with 4 distinguishable regions: 

1) All members of the gene-family contain a hydro- 
phobic leader peptide (L) of 34 amino- acids, which is 
cleaved off and therefore missing in the mature protein. 

2) It is followed by the N-tcrrninal domain, which for 
the members of the CEA subgroup comprises 1 08 AAs, 
for the second subgroup PSG 1-1 1 108-110 AAs. This 
characteristic domain for the CEA gene-family is void 
of cysteines, but was found nevertheless to resemble the 
variable region of immunoglobulins, for which reason 
the N-domain is also called IgV-like domain (15). 

3 ) Following this domain the various members show a 



different number of repeats, for CEA A 1 -A3, B1-B3, of 
which only the first (Al, Bl) is with 178 amino-acids 
alike for all members of the CEA subgroup. In contrast 
to the N-terminal domain, the repeated A- and B- 
domains contain cysteines, which in position and dis- 
tance from each other resemble those of the constant 
(C) region of the heavy chains of the immunoglobulins. 
These regions are, therefore, called IgC-like domains 
(15). 

4) Some members of the CEA-subgroup, of which the 
sequence is known and with the exception of BGP have 
a membrane-associated (M) carboxyterminal domain, 
which consists mostly of hydrophobic aminoacids, 
which are, however, cleaved off and are replaced by a 
glycosylphosphatidylinosytol (GPi), a so-called pick 
tail structure, which anchors the protein in the cell 
membrane (16). 

BGP is a classical transmembrane protein with a 
carboxy-terminal region consisting of a transmem- 
brane domain of 43 hydrophobic aminoacids, followed 
by a hydrophiiic cytoplasmic region, which according 
to the different splice variants, is made up of a varying 
number of aminoacids (L7). 

All members of the PSG-subgroup have relatively 
short C-domains and this together with a lack of the Mi- 
domain, which is characteristic for these molecules, has 
probably also f unctional consequences in respect to the 
secretion. Most PSGs are easily secreted and it is as yet 
unknown whether there are PSG-variants, which, like 
CEA or NCA, are membrane-bound via a GPI- 
structure (18). 

Ail genes of the two subgroups of the CEA gene- 
family are localized as clusters on the long arm of 
chromosome 1 9 at position q 1 3. 1 -q 3 3 J, possibly close 
to the genes coding for the TGF-B1 and LIPE (19). 

The finding, that in the CEA the A- and B-domains 
are combined to form 3 identical repeats, which with 
the N-domain were found to be very similar to the 
immunoglobulin variable and constant domains, re- 
spectively, suggested, that CEA and its related 
molecules belong to the Ig-supergene family (20), to 
which belong among others such well-known 
molecules as the T-cell receptor, the platelet-derived 
growth factor receptor, the MHC-class I and class 11 
antigens, the N-CAM, etc. The functions of these vari- 
ous members have as common trait in that they are 
well-characterized and serve as ccll-cell-interaction-, 
recognition-, or cell adhesion molecules. This for the 
first time gave a hint as to the possible function of 
CEA-famify members and led to several in-vitro 
studies, which so far seem to show that CEA might. 
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indeed, play a role in cell-cell-adhesion mechanisms 

(21) . This will be discussed in the next paper. 

In clinical oncology it is a well-known fact that 
CEA is present more often in tissues than in sera of 
tumor-bearing patients; the reason of this discrepancy 
is not known. By xenotransplantation experiments in 
nude mice we showed that it must be an intrinsic factor 
of the tumors, whether or not CEA is released into the 
circulation; there were CEA-positive tumors which did 
not release it in humans, but once transplanted secreted 
CEA immediately, and there were others that grew and 
produced CEA, but did not release it in the nude mouse 

(22) . The mechanism of secretion is not yet known but 
actively studied. It may well be that a GPI-specific 
phospholipase D is involved: this enzyme is present in 



human sera and granulocytes. However, this has yet to 
be proved (23, 24). 

Apparently it is still a long way to elucidate the 
function and role of CEA, which after all is an em- 
bryonic antigen first, before becoming a tumor antigen 
and its biological role should be compatible for the 
physiological as well as the pathological state. 
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